STIC Biotechnology Systems Branch 

RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 

Source: ^LP1j3P . 

Date Processed by STIC : ®57///%&®~L 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.4.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1 . EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, V A 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 

U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, V A 22314 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/577,893 



DATE: 05/11/2006 
TIME: 11:07:16 
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Does Not Comply 
Corrected Disk*** M^ed 



= threonine or serine 



Input Set : A:\21564Y SEQ 05 01 06. TXT 
Output Set: N:\CRF4\05112006\J577893.raw 

<110> APPLICANT: Merck & Co . , Inc. 

Istituto di Ricerche di Biologia Molecolare P. Angeletti S.p.A. 
<120> TITLE OF INVENTION: HCV REPLICONS CONTAINING NS5B FROM 

GENOTYPE 2B 
<130> FILE REFERENCE: 21564Y PCT 

<140> CURRENT APPLICATION NUMBER: US/10/577 , 893 
<141> CURRENT FILING DATE: 2006-05-01 

<150> PRIOR APPLICATION NUMBER : 60/517,605 
<151> PRIOR FILING DATE: 2003-11-05 
<160> NUMBER OF SEQ ID NOS : 28 

<170> SOFTWARE: FastSEQ for Windows Version 4.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 591 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<22 0> FEATURE: 

<223> OTHER INFORMATION: modified NS5B 
<221> NAME /KEY: VARIANT 
<222> LOCATION: (5) . . . (5) 
<223> OTHER INFORMATION: Xaa 
<221> VARIANT 

<222> LOCATION: (24) ... (24) 
<22 3> OTHER INFORMATION: Xaa 
<221> VARIANT 

<222> LOCATION: (31) . . . (31) 
<223> OTHER INFORMATION: Xaa 
<221> VARIANT 

<222> LOCATION: ^ 

<22 3> OTHER INFORMATION : rf[£t 
<400> 1 

Ser Met Ser Tyr Xaa Trp Thr Gly Ala Leu lie Thr Pro Cys Gly Pro 

15 10 15 

Glu Glu Glu Lys Leu Pro lie Xaa Pro Leu Ser Asn Ser Leu Xaa Arg 

20 25 30 

Phe His Asn Lys Val Tyr Ser Thr Thr Ser Arg Ser Ala Ser Leu Arg 

35 40 45 

Ala Lys Lys Val Thr Phe Asp Arg Val Gin Val Leu Asp Ala His Tyr 

50 55 60 

Asp Ser Val Leu Gin Asp Val Lys Arg Ala Ala Ser Lys Val Ser Ala 
65 70 75 80 

Arg Leu Leu Thr Val Glu Glu Ala Cys Ala Leu Thr Pro Pro His Ser 

85 90 95 

Ala Lys Ser Arg Tyr Gly Phe Gly Ala Lys Glu Val Arg Ser Leu Ser 



= asparagine or serine 



= methionine or isoleucine 




4 



= isoleucine or leucine 
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57 100 105 110 

58 Arg Arg Ala Val Asn His lie Arg Ser Val Trp Glu Asp Leu Leu Glu 

59 115 120 125 

60 Asp Gin His Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn Glu Val 

61 130 135 140 

62 Phe Cys lie Asp Pro Thr Lys Gly Gly Lys Lys Pro Ala Arg Leu lie 

63 145 150 155 160 

64 Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr 

65 165 170 175 

66 Asp lie Ala Gin Lys Leu Pro Lys Ala lie Met Gly Pro Ser Tyr Gly 

67 180 185 190 

68 Phe Gin Tyr Ser Pro Ala Glu Arg Val Asp Phe Leu Leu Lys Ala Trp 

69 195 200 205 

70 Gly Ser Lys Lys Asp Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe 

71 210 215 220 

72 Asp Ser Thr Val Thr Glu Arg Asp lie Arg Thr Glu Glu Ser lie Tyr 

73 225 230 235 240 

74 Gin Ala Cys Ser Leu Pro Gin Glu Ala Arg Thr Val lie His Ser Leu 

75 245 250 255 

76 Thr Glu Arg Leu Tyr Val Gly Gly Pro Met Thr Asn Ser Lys Gly Gin 

77 260 265 270 

78 Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser 

79 275 280 285 

80 Met Gly Asn Thr Met Thr Cys Tyr lie Lys Ala Leu Ala Ala Cys Lys 

81 290 295 300 

82 Ala Ala Gly lie Val Asp Pro Val Met Leu Val Cys Gly Asp Asp Leu 

83 305 310 315 320 

84 Val Val lie Ser Glu Ser Gin Gly Asn Glu Glu Asp Glu Arg Asn Leu 

85 325 330 335 

86 Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp 

87 340 345 350 

88 Leu Pro Arg Pro Glu Tyr Asp Leu Glu Leu lie Thr Ser Cys Ser Ser 

89 355 360 365 

90 Asn Val Ser Val Ala Leu Asp Ser Arg Gly Arg Arg Arg Ty r Phe Leu 

91 370 37! 

92 Thr Arg Asp Pro Thr Thr Prc/XaaVrhr Arg Ala Ala Trp Glu Thr Val 

93 385 390 ^ 395 400 

94 Arg His Ser Pro Val Asn Ser Trp Leu Gly Asn lie lie Gin Tyr Ala 

95 405 410 415 

96 Pro Thr lie Trp Val Arg Met Val lie Met Thr His Phe Phe Ser lie 

97 420 425 430 

98 Leu Leu Ala Gin Asp Thr Leu Asn Gin Asn Leu Asn Phe Glu Met Tyr 

99 435 440 445 

100 Gly Ala Val Tyr Ser Val Asn Pro Leu Asp Leu Pro Ala lie lie Glu 

101 450 455 460 

102 Arg Leu His Gly Leu Glu Ala Phe Ser Leu His Thr Tyr Ser Pro His 

103 465 470 475 480 

104 Glu Leu Ser Arg Val Ala Ala Thr Leu Arg Lys Leu Gly Ala Pro Pro 

105 485 490 495 



-> 



0 
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RAW SEQUENCE LISTING DATE: 05/11/2006 

PATENT APPLICATION: US/10/577 , 8 93 TIME: 11:07:16 

Input Set : A:\21564Y SEQ 05 01 06. TXT 
Output Set: N:\CRF4\05112006\J577893.raw 

106 Leu Arg Ala Trp Lys Ser Arg Ala Arg Ala Val Arg Ala Ser Leu lie 

107 500 505 510 

108 Ala Gin Gly Ala Arg Ala Ala lie Cys Gly Arg Tyr Leu Phe Asn Trp 

109 515 520 525 

110 Ala Val Lys Thr Lys Leu Lys Leu Thr Pro Leu Pro Glu Ala Ser Arg 

111 530 535 540 

112 Leu Asp Leu Ser Gly Trp Phe Thr Val Gly Ala Gly Gly Gly Asp lie 

113 545 550 555 560 

114 Tyr His Ser Val Ser His Ala Arg Pro Arg Leu Leu Leu Leu Cys Leu 

115 565 570 575 

116 Leu Leu Leu Ser Val Gly Val Gly lie Phe Leu Leu Pro Asp Arg 

117 580 585 590 

120 <210> SEQ ID NO: 2 

121 <211> LENGTH: 1776 

122 <212> TYPE: DNA 

123 <213> ORGANISM: Artificial Sequence 

125 <220> FEATURE: 

126 <223> OTHER INFORMATION: modified NS5B 
W--> 128 <221> NAME /KEY: variation 

129 <222> LOCATION: (3)... (3) 

130 <22 3> OTHER INFORMATION: n = A or T 
W--> 132 <221> variation 

133 <222> LOCATION: (9)... (9) 

134 <22 3> OTHER INFORMATION: n = C or A 
W--> 136 <221> variation 

137 <222> LOCATION: (13)... (13) 

138 <223> OTHER INFORMATION: n = A or T 
W--> 140 <221> variation 

141 <222> LOCATION: (15)... (15) 

142 <22 3> OTHER INFORMATION: n = A or C 
W--> 144 <221> variation 

145 <222> LOCATION: (21)... (21) 

146 <22 3> OTHER INFORMATION: n - A or G 
W--> 148 <221> variation 

149 <222> LOCATION: (24)... (24) 

150 <223> OTHER INFORMATION: n = C or G 
W--> 152 <221> variation 

153 <222> LOCATION: (28) . . . (28) 

154 <223> OTHER INFORMATION: n = T or C 
W--> 156 <221> modif iedbase 

157 <222> LOCATION: (30)... (30) 

158 <223> OTHER INFORMATION: n = G or C 
W--> 160 <221> variation 

161 <222> LOCATION: (33)... (33) 

162 <223> OTHER INFORMATION: n = C or A 
W--> 164 <221> variation 

165 <222> LOCATION: (71)... (71) 

166 <223> OTHER INFORMATION: n = A or G 
W--> 168 <221> variation 
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RAW SEQUENCE LISTING DATE: 05/11/2006 

PATENT APPLICATION: US/10/577 , 893 TIME: 11:07:16 

Input Set : A:\21564Y SEQ 05 01 06. TXT 
Output Set: N:\CRF4\05112006\J577893.raw 

169 <222> LOCATION: (83)... (83) 

170 <223> OTHER INFORMATION: n = G or T 
W--> 172 <221> variation 

173 <222> LOCATION: (1174) . . . (1174) 

174 <223> OTHER INFORMATION: n = A or C 
W--> 176 <400> 2 

W--> 177 tcnatgtcnt acncntggac nggngccntn atnacaccat gtgggcccga agaggagaag 60 
W--> 17 8 ttaccgatca nccctctgag taattcgctc atncggttcc ataataaggt gtactccaca 120 

179 acctcgagga gtgcctctct gagggcaaag aaggtgactt ttgacagggt gcaggtgctg 180 

180 gacgcacact atgactcagt cttgcaggac gttaagcggg ccgcctctaa ggttagtgcg 240 

181 aggctcctca cggtagagga agcctgcgcg ctgaccccgc cccactccgc caaatcgcga 300 

182 tacggatttg gggcaaaaga ggtgcgcagc ttatctagga gggccgttaa ccacatccgg 36 0 

183 tccgtgtggg aggacctcct ggaagaccaa cataccccaa ttgacacaac tatcatggct 420 

184 aaaaatgagg tgttctgcat tgatccaact aaaggtggga aaaagccagc tcgcctcatc 480 

185 gtataccccg accttggggt cagggtgtgc gaaaagatgg ccctctatga catcgcacaa 540 

186 aagcttccca aagcgataat ggggccatcc tatgggttcc aatactctcc cgcagaacgg 600 

187 gtcgatttcc tcctcaaagc ttggggaagt aagaaggacc caatggggtt ctcgtatgac 660 

188 acccgctgct ttgactcaac cgtcacggag agggacataa gaacagaaga atccatatat 72 0 

189 caggcttgtt ctctgcctca agaagccaga actgtcatac actcgctcac tgagagactt 780 

190 tacgtaggag ggcccatgac aaacagcaaa gggcaatcct gcggctacag gcgttgccgc 840 

191 gcaagcggtg ttttcaccac cagcatgggg aataccatga catgttacat caaagccctt 900 

192 gcagcgtgta aggctgcagg gatcgtggac cctgttatgt tggtgtgtgg agacgacctg 960 

193 gtcgtcatct cagagagcca aggtaacgag gaggacgagc gaaacctgag agctttcacg 102 0 

194 gaggctatga ccaggtattc cgcccctccc ggtgaccttc ccagaccgga atatgacttg 1080 

195 gagcttataa catcctgctc ctcaaacgta tcggtagcgc tggactctcg gggtcgccgc 1140 
W--> 196 cggtacttcc taaccagaga ccctaccact ccantcaccc gagctgcttg ggaaacagta 1200 

197 agacactccc ctgtcaattc ttggctgggc aacatcatcc agtacgcccc cacaatctgg 1260 

198 gtccggatgg tcataatgac tcacttcttc tccatactat tggcccagga cactctgaac 1320 

199 caaaatctca attttgagat gtacggggca gtatactcgg tcaatccatt agacctaccg 1380 

200 gccataattg aaaggctaca tgggcttgaa gccttttcac tgcacacata ctctccccac 1440 

201 gaactctcac gggtggcagc aactctcaga aaacttggag cgcctcccct tagagcgtgg 1500 

202 aagagtcggg cgcgtgccgt gagagcttca ctcatcgccc aaggagcgag ggcggccatt 1560 

203 tgtggccgct acctcttcaa ctgggcggtg aaaacaaagc tcaaactcac tccattgccc 1620 

204 gaggcgagcc gcctggattt atccgggtgg ttcaccgtgg gcgccggcgg gggcgacatt 1680 

205 tatcacagcg tgtcgcatgc ccgaccccgc ctattactcc tttgcctact cctacttagc 1740 

206 gtaggagtag gcatcttttt actccccgat cgatga 1776 

208 <210> SEQ ID NO: 3 

209 <211> LENGTH: 1394 

210 <212> TYPE: PRT 

211 <213> ORGANISM: Artificial Sequence 

213 <220> FEATURE: 

214 <223> OTHER INFORMATION: modified NS3-5A 
W--> 216 <221> NAME /KEY: VARIANT 

217 <222> LOCATION: ( 12 15 ) . . . ( 12 15 ) 

218 <223> OTHER INFORMATION: Xaa = asparagine or serine 
W--> 220 <221> VARIANT 

221 <222> LOCATION: ( 904 ) . . . ( 904 ) 

222 <223> OTHER INFORMATION: Xaa = valine or alanine 
W--> 224 <400> 3 
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RAW SEQUENCE LISTING DATE: 05/11/2006 

PATENT APPLICATION: US/10/577,893 TIME: 11:07:16 



Input Set : A:\21564Y SEQ 05 01 06. TXT 
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RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/10/577 , 893 

Input Set : A:\21564Y SEQ 05 01 06. TXT 
Output Set: N:\CRF4\05112006\J577893.raw 



DATE: 05/11/2006 
TIME: 11:07:17 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <22 0> 
to <223> fields ot^^cp. ^sequence which presents at least one n or Xaa. 

Seq#:l; Xaa Pos .^5 ,^24 
Seq#:2; N POS . 3,9,1 
Seq#:3; Xaa Pos . 904,1 
Seq#:4; N Pos. 3644 

Use of <220> Feature (NEW RUL 




Sequence (s) are missing the < 

Use of <220> to <223> is MANDATORY 
o r" Unknown " . Please explain source 

section (See "Federal Register," 6/01/98, Vol. 63, 
(Sec. 1.823 of new Rules) 




Feature ana associated headings. 

f <213> ORGANISM is "Artificial Sequence" 
ot^gtenetic material in <220> to <223> 

No. 104, pp. 29631-32) 



Seq#: 1,2, 3,4,24 
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Input Set : A:\21564Y SEQ 05 01 06. TXT 
Output Set: N:\CRF4\05112006\J577893.raw 

L:12 M:270 C: Current Application Number differs, Replaced Current Application No 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:27 M:281 W: Numeric Fields not Ordered, <221> Sort in ascending order! 
L:31 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:1 
L:35 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:1 
L:39 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:1 
L:43 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ ID#:1 
L:44 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 after pos . : 0 
L:46 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 after pos . : 16 

(46) "n" or "Xaa" used, for SEQ ID# : 1 after pos.:384 
Numeric Fields not Ordered, <221> Sort in ascending order! 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 
Mandatory Feature missing 

(46) "n" or "Xaa" used, for SEQ ID#:2 after pos . : 0 
(46) "n" or "Xaa" used, for SEQ ID#:2 after pos.:60 
(46) "n" or "Xaa" used, for SEQ ID#:2 after pos.:1140 
Numeric Fields not Ordered, <221> Sort in ascending order! 
Mandatory Feature missing, <220> Tag not found for SEQ ID# : 3 
Mandatory Feature missing, <220> Tag not found for SEQ ID#:3 
(46) "n" or "Xaa" used, for SEQ ID#:3 after pos.:896 
(46) "n" or "Xaa" used, for SEQ ID#:3 after pos.:1200 
Numeric Fields not Ordered, <221> Sort in ascending order! 
Mandatory Feature missing, <220> Tag not found for SEQ ID#:4 
Mandatory Feature missing, <220> Tag not found for SEQ ID#:4 
(46) "n" or "Xaa" used, for SEQ ID#:4 after pos.: 3600 
Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:24 
Mandatory Feature missing, <220> Tag not found for SEQ#:24, <213> 
icial Sequence 

Mandatory Feature missing, <223> Tag not found for SEQ#:24, <213> 
icial Sequence 

Mandatory Feature missing, <223> Blank for SEQ# : 24 , Line# : 705 
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